-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactoring, arg list handling, add two parameters #11
Conversation
…ged_prot to 19_palm to reduce length of arglist; (3) move small families to the side in alignment step; (4) correct printing of header for counts per accession
…ommon.sh. Likely breakages. Incomplete.
… from previous runs
… in the summarize step
…ther minor cleanup
It looks like existing conf files need to be updated; the
https://github.com/legumeinfo/pandagma/actions/runs/8078741346 |
…ndagma-fsup.sh to work with the pandagma driver script and pandagma-common.sh
I think I've fixed at least those config omissions. |
From the results of the last workflow run: The new default workflow steps in The Glycine_7_3_2 and Vigna4_7_1_4 jobs completed, but encountered an OS "too many open files" error when running actions/upload-artifact; e.g.:
The previous successful run of Glycine_7_3_2 resulted in only 23 files in the out_pandagma directory (instead of 152583 files). The extra files are in the 21_hmm, 22_hmmalign, 23_hmmalign_trim2, and 24_trees directories. Are these needed in the out_pandagma for |
For the default workflows, I think we should NOT include these steps:
Omitting these should fix both the runtime and file-count problems. What determines which steps are included in "default" in the GitHub actions? |
The GitHub Actions workflow is running the default sequence of pandagma workflow steps run when the pandagma/.github/workflows/test.yml Line 57 in 3e642e4
|
OK. I'll pull the optional steps out of -pan and -fam (just removing them from |
…trees, and xfr_aligns_trees from commandlist in -pan and -fam workflows
…s are used if stats/ks_histplots.tsv isn't provided
@nathanweeks Ready for review again. I tested the |
GA workflow passed. |
Many changes in this branch.
ARG_MAX
is 2097152 on both Ceres and NERSC/Perlmutter, but I saw this error only when running on Perlmutter). My fix was simpleminded, using pushd/popd to remove repeated inclusion of directory names. There might be better solutions (xargs
or the like), but I think my fix should be OK for the time-being.I have tested the changes on two large (and pretty costly) production runs:
pan
for 57 Glycine annotations andfam
on 25 species.